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The MAILING DATE of this communication appears on the cover sheet with the correspondence address 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .1 36(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 
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2a)D This action is FINAL. 2b)S This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 
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DETAILED ACTION 



Response to Applicant's Election with Traverse 



Applicant's election with traverse of claims 1-9 in Paper No. 4 is acknowledged. 
The traversal is on the ground(s) that the non-elected claims share some common 
features. This is not found persuasive because: (1 ) The Applicant does not clearly point 
out as to what the common features are; and (2) The Applicant did not address all the 
items that were used as the basis for restriction. For instance, two inventions sharing 
some common feature could still gain separate use in the art. In addition, combination 
and subcombinations can be separately used. More importantly, if the common features 
are not identified it is difficult to difficult what constitutes the crux of the invention. Also, 
an issue may be raised as to how do the features other than the common features 
related the invention. 

The requirement is still deemed proper and is therefore made FINAL. 

Claims 10-18 are withdrawn from further consideration pursuant to 37 CFR 
1.142(b), as being drawn to a nonelected inventions, there being no allowable generic 
or linking claim. Applicant timely traversed the restriction (election) requirement in Paper 
No. 2 

Claims 1-9 are pending in this Office Action. 



Priority 

Receipt is acknowledged of papers submitted under 35 U.S.C. 1 19(a)-(d), which 
papers have been placed of record in the file. 
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Abstract 



The Applicant is requested to revise the abstract to more appropriately reflect the 
invention as claimed. 



Claim 8 is rejected under 35 U.S.C. 112, second paragraph, as being indefinite 
for failing to particularly point out and distinctly claim the subject matter which applicant 
regards as the invention. 

Claim 8. An apparatus according to claim 7, wherein said property information 
includes a writer of the file received in response to the search request, a document title, 
and the like. 

Regarding claim 8, the phrase "or the like" renders the claim(s) indefinite 
because the claim(s) include(s) elements not actually disclosed (those encompassed by 
"or the like"), thereby rendering the scope of the claim(s) unascertainable. See MPEP 
§ 2173.05(d). 

Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 



Claim Rejections - 35 USC §112 



States. 
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Claims 1-2, 4-5, and 9 are rejected under 35 U.S.C. 102(b) as being anticipated 
by the publication, "WebMate: A Personal Agent for browsing and Searching," Chen et 
al., Proceedings of the 2 nd International Conference on Autonomous Agents, May, 1998, 
NY, USA, ACM Press, pages 132-139, hereinafter "WebMate." 

As to Claim 1 , WebMate teaches, a document information search apparatus for 
searching document information on the basis of a search request transmitted through a 
network (Page 134, Fig. 1; search requests are made in the WWW) and responding, 
wherein: 

a search condition designating unit which designates a file as a search condition 
(page 134, col. 2, lines 4; page 137, col. 2, lines 13-18 ("the context of the search 
keywords in the relevant web pages is used"); a user designates a URL); 
and 

transmits contents of said designated file via the network is provided for a search 
requesting source (page 134; Fig. 1 ; WebMate receives the web page designated by a 
user); and 

a document search unit which forms a keyword from the file contents transmitted 
from said search condition designating unit (page 134, col. 2, the 2 nd paragraph; 
WebMate constructs a query based on a current profile which is formed of the keywords 
that come from a plurality of domains including the Web page visited by users when the 
users designate them; the creation of a personal profile is described in page 133, col. 1- 
2, section 3.1) and 
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searches similar documents from a database (page 134, col. 2; WebMate 
calculates similarity between the profile and a plurality of Web pages, and recommends 
the ones based on a threshold; note that WebMate searches a plurality of URL's of 
users do not designate any particular Web page or URL) is provided on a search side. 



As to claim 2, WebMate teaches an apparatus according to claim 1 , wherein said 
search condition designating unit transmits a head file portion of the designated file 
contents (page 137, lines 13-18; since the designated file is a Web page, the URL 
associated with the designated Web page is considered the head file) 



Claim 4 recites the following: 

an apparatus according to claim 1, wherein index information describing a list of 
important words extracted from search target documents is stored every document in 
said database, and said document search unit on the search side comprises: a text 
extraction processing unit which extracts a text document from the file contents received 
in response to the search request; a morpheme analyzing unit which extracts nouns by 
a morpheme analysis of said .text document; a keyword forming unit which extracts 
important words from said nouns and forms a keyword in which said important words 
are coupled by OR; and a search executing unit which searches similar documents by 
searching the search database by said keyword and notifies the search requesting 
source of a search result 



With respect to the limitations of claim 4, WebMate teaches creation of profiles 
and generation of relevant Web pages by extracting keywords from the relevant Web 
pages by using TF-IDF (term frequency - inverse document frequency) method. The 
TF-IDF requires that all documents be parsed for extracting keywords including nouns, 
and excluding the stop words, and also requires that documents be ranked in a 
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particular order. As to the step of notifying the search, see page 137, col. 2, and page 
138, col. 1 , wherein a list of 5 relevant documents is provided. 

As to claim 5 (an apparatus according to claim 4, wherein said keyword forming 
unit counts the number of times of appearance showing in which documents in the 
index of each of the search documents stored in said document database each of said 
nouns appears, selects a predetermined number of upper words each having the 
number of times of appearance in a predetermined range, and forms the keyword), 
WebMate teaches the use of TF/IDF method and in addition, teaches the use of "top 5 
words" in documents for retrieval of the most relevant documents (page 138, col. 1 , line 
15-17). 

As to, claim 9 (an apparatus according to claim 1, wherein said search condition 
designating unit of said search requesting source is provided by a WWW browser of a 
client, transmits the contents of the file designated by a search request picture plane of 
said WWW browser to a search machine of a WWW server through the network, and 
sends said file contents to said document search unit), WebMate shows the WWW 
environment in page 134, Figure 1. In accordance with the description provided on page 
7, lines 18-22 of the Applicant's Disclosure, it appears that the "search request picture 
plane" is nothing more than a query box where a keyword can be typed in by a user. 
Since WebMate teaches a browser, it inherently teaches the query box and/or search 
request picture plane as claimed . 
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Claim Rejections - 35 USC § 103 



Claim 6 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
WebMate. 

Claim 6 (an apparatus according to claim 5, wherein in the case where the 
number of documents in the index is assumed to be (N), said keyword forming unit 
selects upper ten words each having the number (H) of times of appearance in a range 
where 2N/3.gtoreq.H.gtoreq.1 and forms the keyword) requires that top 10 keywords be 
used to rank and present most relevant documents. WebMate suggests that top 5 
keywords be used (page 138, col. 1, line 15-17). 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to select top 10 instead of top 5 because such a change can be 
adopted without reconfiguring the WebMate. system or without incurring any 
reconfiguration overhead, while a person of ordinary skill in the art would find this as an 
added flexibility of the system. 

Claims 7 and 8 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
WebMate as applied to claims 1 , 4, and 5 above, and further in view of the publication, 
"CiteSeer: An Autonomous Web Agent for Automatic Retrieval and identification of 
Interesting Publications," by Bollacker et al., proceedings of the International 
Conference on Autonomous Agents, May 1998, ACM Press, pages 1 16-123, hereinafter 
"Citeseer." 
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As to claim 7 (an apparatus according to claim 5, wherein said keyword forming 
unit allows property information extracted from the file received in response to the 
search request to be included in said keyword, thereby allowing the similar documents 
to be searched), WebMate discloses the extraction of keywords (WebMate teaches that 
a user can provide any URLs that he would like to be the information sources and that 
the chosen URL may be used to expand the search in page 134, col. 2, lines 4-6), but 
does not explicitly indicate that the keywords include property information as claimed 

As to claim 8 (an apparatus according to claim 7, wherein said property 
information includes a writer of the file received in response to the search request, a 
document title, and the like), WebMate discloses the extraction of keywords, but does 
not explicitly indicate that the keywords extracted include the writer of the file or the title 
of the file. 

As to claims 7 and 8, Citeseer uses a sub-agent to search a plurality of Web 
pages when a broad keyword is entered by a user in the search query (page 118, col. 1 , 
section 3.13, lines 5-6). Citeseer further teaches the extraction of title and author in 
response to submitted query (page 118, col. 2, the bottom paragraph). 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to combine WebMate and Citeseer to make the system user- 
friendlier as such the user will be able to see the bibliographic information of to-be- 
retrieved documents. It would have been obvious to a person of ordinary skill in the art 
at the time the invention was made to combine WebMate and Citeseer to eliminate 
some of the retrieval candidates (i.e., to-be-retrieved documents) to avoid the down- 
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loaning and transmission overhead. It is general knowledge available to one of ordinary 
skill in the data processing art that a document is more likely to have bibliographic 
information and that a retrieval system incurs overhead to download a document. 

Claim 3 is rejected under 35 U.S.C. 103(a) as being unpatentable over WebMate 
as applied to claim 1 above, and further in view of U. S. Patent No. 6,182,085 issued to 
Eichstaedt ("the '085 patent"). 

As to claim 3 (an apparatus according to claim 1, wherein said search condition 
designating unit allows an HTML file and an Excel file to be included in the file which is 
designated as said search condition), WebMate teaches the parsing of an HTML page 
(page 134, col. 1 , section 3.2. lines 6), however does not explicitly indicate the 
processing of an EXCEL file. 

As to the limitation, "...said search condition designating unit allows an HTML file 
and an Excel file to be included in the file which is designated as said search condition", 
WebMate does not explicitly indicate that it is capable of parsing a EXCEL file submitted 
a query. 

With respect to claim 3, the '085 patent (Eichstaedt et al.) t in col. 5, lines 12-32, 
teaches: 

One example of a Gatherer 302 communicatively linked to a web 304 is pictured in FIG. 
3 and has a number of components. The web 304 may comprise an Internet, an intranet, 
or a single information source including media or multimedia objects. The Gatherer 
302 may include a Crawler 306 component that crawls media sources and retrieves 
objects while a Recognizer 308 component tries to determine the format for each of 
the retrieved objects. A Summarizer 310 component contains specialized codes that 
enable it to read a great number of different object formats such as a Freelance 
graphics presentation, an HTML page, a Lotus Notes database, or an Excel spreadsheet . 
It also provides a flexible structure for plugging in customized summarization codes 
to be used for summarizing data from a specific location. Compressed files included 
in a ZIP, TAR or JAR file are first extracted out by an Expander 312 component and 
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then processed by the Summarizer 310. A Gatherer may also carry an embedded 
HTTP server (not shown) so that system administrators can use a web-browser to 
control its operations and monitor its status. 

Therefore, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to combine WebMate and the '085 patent to make the 
system more user-friendly as such the user would be to submit queries in any form and 
would not have to convert the query object into a specific format to search for relevant 
information. 



Prior Art 

The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

"Phrasier: A System for Interactive Document Retrieval Using Keyphrases," by 
Jones et al., Proceedings of the 22 nd Annual International CM SIGIR Conference on 
Research and Development in Information Retrieval, August, 1999, pages 160-167, 
hereinafter, "Phrasier." Fig. 1 is a user interface; it shows three panes: one for selecting 
text or key phrases, another one for search results and yet another one for displaying a 
particular document obtained in the result list; upper left pane and the middle pane in 
the background are used for designating text or a file from which queries are formed; 
see page 161 , col. 2, at the bottom - "user enters a text or it is read from a file ") 



U. S. Patent No. 5,983,221 issued to Christy 
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Contact Information 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Hosain T. Alam whose telephone number is (703) 308- 
6662. The examiner can normally be reached on Monday-Thursday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 

supervisor, Kim Y Vu can be reached on (703) 305-4393. The fax phone numbers for 

the organization where this application or proceeding is assigned are (703) 308-6606 for 

regular communications and (703) 308-6606 for After Final communications. 

Any inquiry of a general nature or relating to the status of this application, or proceeding 
should be directed to the receptionist whose telephone number is (703) 305 3800. 

The following contact numbers may also be used: 

TC 21 00 After Finals number is 703-746-7238 

TC 2100 Official Fax number is 703-746-7239 

TC 2100 Customer Service Center is 703-746-7240 




Hosain T Alam 
Primary Examiner 
Art Unit 21 72 
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